Temporal scaling at Feigenbaum points and non-extensive thermodynamics 
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We show that recent claims for the non-stationary behaviour of the logistic map at the Feigenbaum 
point based on non-extensive thermodynamics are either wrong or can be easily deduced from well- 
known properties of the Feigenbaum attractor. In particular, there is no generalized Pesin identity 
for this system, the existing "proofs" being based on misconceptions about basic notions of ergodic 
theory. In deriving several new scaling laws of the Feigenbaum attractor, thorough use is made of 
its detailed structure, but there is no obvious connection to non-extensive thermodynamics. 



During the last decade has appeared a vast literature 
on a new "non-extensive thermodynamics" (NET), which 
uses a maximum entropy principle with the Shannon en- 
tropy replaced by the Havrda-Charvat [l| ("Tsallis") en- 
tropy |a I! Ilia- Several versions of NET were proposed 
by Tsallis and others in order to avoid inconsisten- 
cies. But as shown most forcefully by Nauenberg 6] , none 
of these versions is consistent with thermodynamics, at 
least for equilibrium systems. 

For non-equilibrium systems the situation is not quite 
so simple, as the standard maximum entropy principle is 
usually not applicable. As shown by Jaynes [3, the for- 
malism of statistical physics can be obtained by Occam's 
razor: If all knowledge can be formulated in terms of 
constraints, then the only rationally justifiable ansatz for 
the probability distribution is the one which maximizes 
Shannon entropy (which has to be replaced by Kullback- 
Leibler information, if some knowledge existed prior to 
these constraints). The main reason why standard ther- 
modynamics cannot be applied to most non-equilibrium 
systems is that prior knowledge cannot be cast into the 
form of a few constraints. But there is no reason for aban- 
doning an information theoretic interpretation of entropy, 
and Shannon (Kullback-Leibler) entropy is the only con- 
sistent probabilistic measure of (relative) information. 

The main reason why NET is still vigorously pursued 
is, it seems, the claim that it is able to make strik- 
ing predictions that could not be made within a more 
conventional framework. Typically, these are for non- 
equilibrium phenomena with distributions showing power 
laws with heavy tails. In deriving these distributions, not 
only is Shannon entropy replaced arbitrarily by Tsallis 
entropy. Also the constraints arc modified in a way which 
has no clear rational motivation - except that one arrives 
thereby at expressions more easily handled. 

A careful study of most - if not all - examples where 
NET was supposedly successful shows that the success is 
much less clear than claimed. In a later publication, we 
will substantiate this further by discussing several such 
examples. In the present letter, we discuss in depth just 
one single example, which has been treated in several 

l2ll 122, 122J , and which was claimed to show the success 
of NET in a particularly clear way. 

This example is the non-stationary behaviour of the 



Feigenbaum attractor 1241 ]. More precisely, one can 
study: (1) sensitive dependence on initial conditions 
(both for finite and infinitesimal perturbations, both on 
the attractor and in its basin of attraction) ; (2) scaling of 
different dynamical (Shannon, Renyi, Tsallis) entropies 
of various ensembles of trajectories with their time length 
T; (3) convergence of a typical trajectory (with random 
initial condition in its vicinity) to the attractor; (4) scal- 
ing of "static" (i.e., Boltzmann-Gibbs) entropies of an 
ensemble of points with time. 

Most of these problems have been discussed since the 
early 1980's [UlSSEllHliil- Also, it had been real- 
ized from the very beginning that some of these questions 
are subtle due to large fluctuations ( "multifractality" of 
the Feigenbaum attractor). In the first papers based on 

NET US E3 

this was missed, leading to wrong claims 
that a single non-extensivity parameter could describe 
scaling at the onset of chaos. Although this was corrected 
recently, we shall see that the effect of fluctuations is still 
not fully appreciated in some of these papers [2H I23I ] . 

In the following we give theoretical arguments only 
for the Feigenbaum map [24| g(x) defined by g(g(x)) 
—a~ 1 g(ax) and g(x) = 1 
we use the logistic map a — x 
for numerical calculations. Problems of universality have 
been discussed e.g. in 

Let us first discuss the dependence on infinitesimal 
changes in the initial conditions, i.e. the behaviour of 
A„(aio) = In \dx n /dxo\, where Xi+i — g{xi). As pointed 
out in 26], A n (xo) fluctuates very strongly with n and 
xq. Different ways of averaging over xq give therefore 
rise to different scalings with n. If we take arithmetic av- 
erages over A„ (i.e., geometric averages over \dx n /dxo\), 
we get 



ex' 2 + 0(x 4 ) for x — > 0, but 
2 with a= 1.401155189... 



dxoWo(xo) In \dx n /dxo\ ~7mn 



(1) 



with 7 = 0.599 ±0.003 for all smooth initial distributions 
Wo(x), at least when we also do an additional averag- 
ing over n to damp out remaining oscillations |3l| . On 
the other hand, as proven rigorously in |26|. arithmetic 
averages over \dx n /dxo\ give 



dxoWo(xo)\dx n /dxo\ ~ ri 



const In 1 



(2) 
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FIG. 1: Scaling of the time averaged local sensitivity expo- 
nents, for trajectories starting at xo = 1. 



Without averaging over xq one can of course obtain 
completely different behaviour. For Xq = 1, e.g., one 
finds \dx n /dxo\ — ► const for n = 2 fc , fc — > oo, while 

|dx„/da; | = a fc = (n+ l) log2 " (3) 

(exactly) for n = 2 k — 1 . Tsallis e£ a/, prefer to 
write Eq.© as |dx n /dx | = (1 + A(l - q)™) 1 ^ 1 -?) with 
A = 1/ (1 — q) = log 2 a, and call the r.h.s. a ^-exponential. 

Notice that Eq.© is not a scaling law, since it holds 
only for special values of n. But one obtains a scaling 
law by taking averages over n (see Fig. 1), 

n 

A n (x ) = n^^A^xo) ~ j3hxn for x = 1 (4) 
j=i 

with /? = 0.71339380 . . .. Notice that the constants in 
Eqs.QJ, ©, and are not directly related. 

Defining A n (x) for any a; on the attractor as in Eq© 
gives the natural generalization of the Lyapunov expo- 
nent. It fluctuates strongly with x I28ll31| and for many 
applications like Pesin's identity [331 ] one needs averages 
over x with respect to the natural measure fJ-(x). In the 
present case, one can easily see that the average 

(A„) ee J dfi(x)A n {x) (5) 

is identically zero for all n J35| , which indicates that there 
cannot exist a close analogy to Pesin's identity. The 
identities suggested and verified in 0, [2^, are triv- 
ialities where the authors started with an initial distri- 
bution wq(x) narrowly localized around xq and followed 
the evolution only up to values of n for which w n (x) is 
still smooth and described by the flow linearized around 
the trajectory starting at xq [23 • m addition they consid- 
ered, instead of the Kolmogorov- Sinai (KS) entropy, just 
the difference S n — Sq between two static "Boltzmann- 
Gibbs" entropies 

S n = - / dx w n (x)\nw n (x). (6) 



In this case one has of course S n —So = A n (x ), but this 
has no connection to any (generalized) Pesin identity. 

Eqs.© and © apply to trajectories starting on the 
attractor. In order to understand the origin of Eqs.© 
and © one has to study how trajectories starting in its 
vicinity approach the attractor. For this one has to use 
the detailed triadic Cantor structure of the Feigenbaum 
attractor [24|. Associated to this structure is a set of 
open disjoint intervals Iks-, the closure of which covers 
the interval [— 1/a, 1] which contains the attractor [2||. 
1(0, 1) is the hole cut out from the middle of the Cantor 
set in the first stage of construction; 1(1, 1) and 1(1, 2) are 
the second generation holes, 1(2, 1) to 1(2, 4) are the holes 
cut out in the third step, etc. Each Iks contains exactly 
one point on the instable periodic orbit of period 2 k . Let 
us define Ik = U< i- e - Ik is all what is cut out during 
the k— th step. The evolution of a typical trajectory can 
then be viewed as a tumbling through the Ifc's, with k 
never decreasing with n. The average increase of k is for 
large n exactly given by 

(k) = const + log 2 n. (7) 

As shown in [2(| , Eq. (|TJ follows (up to the precise value 
of 7) from Eq. J7J) and from the scaling behaviour proven 
by Feigenbaum. 

In [23 the authors studied another average over A n (x), 
in between Eqs.Q and ©, 

£ g (n) ee J dxow (xo)\dx n /dx \ 1 ^ q . (8) 

From straightforward simulations they concluded that 
becomes (asymptotically) linear in n, £ q (n) ~ n, 
for q = 0.36 and called this value q^ n . But neither an- 
alytic nor numerical estimates of £ q (n) seem easy. A di- 
rect numerical estimate as in |20j is prone to large errors, 
since the integrand of Eq.© (with constant wo(xq) and 
for q = q^ n ) is very sharply peaked near the unstable 
periodic orbits of small periods. Choosing initial values 
xq at random one will miss these peaks, unless one has 
extremely high statistics. But it is not clear anyhow why 
Eq.lJHJ should be of interest. In contrast to what its name 
suggests, q^n does not measure the average sensitivity to 
initial conditions but just one particular average. 

Another consequence of Eq.Q is that the (geomet- 
rically) average distance from the attractor of a point 
starting randomly in 1] decreases as |26j 

d n ~ n- 1 '^ (9) 

where D\ = 0.517097 ... is the information dimension of 
the Feigenbaum attractor. 

The same argument can also be immediately used 
to derive the scaling of the "Boltzmann-Gibbs" entropy 
with n, after starting with a random ensemble [23I ]. As- 
sume we have N n points distributed randomly with re- 
spect to w n (x). The distance between point Xn and 
its nearest neighbour is called r,-. The Kozachenko- 
Leonenko estimator [2II for differential entropies gives 
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FIG. 2: Time dependence of the "Boltzmann-Gibbs" en- 
tropy S n , i.e. of the differential Shannon entropy of the x- 
distribution, starting with a uniform distribution over the 
interval [a — a 2 , a]. The three curves correspond to differ- 
ent binnings. The lines connecting the data points are just 
to guide the eye, and omit the lognormal oscillations. The 
straight line indicates the theoretically predicted slope. 



then for large N n 



5,i 



const + In N„ 



E ln ' 



(10) 



To apply this for the present problem, we choose fur- 
thermore N n — uNq so that N n scales as the number of 
intervals Ik,i with k = const + log 2 n. We notice also that 
all Ik,i with i = 1, . . . 2* have roughly the same statisti- 
cal weight. Then the distance to the nearest neighbour 
scales as the size of the interval in which the point is 



located, 



£ \Ik i\ with Xj E lb i, and it follows that 



S n = const — 



l-.Pi 



log 2 



(11) 



Data obtained by spraying N points uniformly onto [a — 
a 2 , a], letting them evolve according to the critical logistic 
map, and estimating S n by binning this interval into M 
bins, are shown in Fig. [3 Here we used N > 2 x 10 9 which 
is largely sufficient for convergence (we used the entropy 
estimator corrections given in Convergence with 

the number of bins is much slower. We see very clear 
changes as we increase M from ss 10 7 to ks 3 x 10 8 . This 
explains also the small remaining discrepancy. 

Although S n had been discussed as an interesting 
quantity in the NET literature [H EHH H3| , we are 
not aware of a previous estimate. Instead, several au- 
thors [ill UtI |23| have studied a different quantity W n , 
which they supposed to have the same scaling. Instead 
of taking a sum over all non-empty bins with weights 
Pi\og(l/pi), W n is just the number of non-empty bins. 
Scaling of this quantity is much more subtle - both theo- 
retically and numerically. We should expect larger finite- 
n, finite M, and finite N corrections for W n than for S n , 
but when n,M, and N are sufficiently large |40|. we can 



apply basically the same reasoning to W n as we did for 
S n : It scales like the total length of all intervals Ik,i with 
k = log 2 n. Calculating the latter is easy and gives 

W n ~ n-0-800138194... _ (12) 

The exponent measured in ^l| was quite different (0.71 ± 
0.01 instead of 0.80), but this is easily explained by the 
expected large corrections to sca ling , which have even led 
to claims of non-universality in |23J. 

The KS entropy for a one-humped map with maxi- 
mum at x = is obtained from the sequences s n = 
(sign(xi), . . .sign(x„)), as 



H 



KS 



lim n H„ 



(13) 



with H n = —J2 Sn P{ s n)\ogp(s n )- As shown in [271 130| . 
for the Feigenbaum attractor one has H n = logn + 0(l). 
Thus the KS entropy is zero, but there is a logarith- 
mic increase of H n . With the same methods one can 
prove exactly, that all Renyi entropies H q = linin^oo H q 
with H% = (1 — q)~ x \ogJ2 s [p( s n)] q are equal for this 
problem, lim^oo H^/H n = 1 for all a For Tsallis- 
type generalized KS entropies, this gives K q = (1 — 
g) _1 (cxp((l - q)H%) - 1) ~ n 1-9 , at variance with 0. 

The ^-independence of the Renyi entropies is surpris- 
ing in view of the multifractality of the attractor. For a 
chaotic attractor, the H% are closely related, via general- 
izations of Pesin's identity, to q-th moments of A(x) [42| . 
This shows again that the onset of chaos is more subtle 
than expected in the NET literature. 

In the first papers @, EJ El El , it was supposed but 
never substantiated that the parameter q of NET can be 
obtained, also for the Feigenbaum map, by some maxi- 
mum entropy principle. Although this was never with- 
drawn, q is now fixed such that one obtains linear time de- 
pendencies, when logarithms are replaced by q— deformed 
logarithms (see e.g. |2(j)- The reason for this is not clear, 
since there is no need for all time dependencies to be lin- 
ear. But even worse, with the proliferation of different 
(although closely related) scaling laws, one obtains - for 
the single case of the Feigenbaum map! - a rich zoo of 
different q values |j,|20|. With the additional scaling laws 
found in the present paper, there would be even more g's 
- unless one accepts at last that non-extensive thermody- 
namics is (at least at the onset of chaos) just a chimera. 
On the other hand, we consider these new scaling laws 
as an important part of the present letter. 

One reason for preferrin g "g -exponentials" (i.e., gener- 
alized Pareto distributions |T| ) over power laws could be 
that the former give correct deviations from pure power 
laws at small arguments. But in all cases studied in the 
present paper (except Eq.Q) the small-n limits are nei- 
ther given by pure powers nor b y q-exp onentials. 

Finally, as a last remark: In UllSHfl the Feigenbaum 
map was chosen as the prototype of a supposedly weakly 
mixing system. But it is not mixing at all. 

I am indebted to Maya Paczuski, Walter Nadler and 
Karol Zyczkowski for numerous discussions. 
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